Clustering with phylogenetic tools in astrophysics
نویسنده
چکیده
Phylogenetic approaches are finding more and more applications outside the field of biology. Astrophysics is no exception since an overwhelming amount of multivariate data has appeared in the last twenty years or so. In particular, the diversification of galaxies throughout the evolution of the Universe quite naturally invokes phylogenetic approaches. We have demonstrated that Maximum Parsimony brings useful astrophysical results, and we now proceed toward the analyses of large datasets for galaxies. In this talk I present how we solve the major difficulties for this goal: the choice of the parameters, their discretization, and the analysis of a high number of objects with an unsupervised NP-hard classification technique like cladistics.
منابع مشابه
Visualizing Restricted Landscapes of Phylogenetic Trees
We are designing tools to visualize very large sets of phylogenetic trees. Our tools give a three dimesional representation of treespace, with two dimensions representing the clustering of trees under multidimensional scaling, and the third dimension (the “height”) the score of the tree (i.e. parsimony or maximum likelihood score). The user can rotate the resulting distribution to get a sense o...
متن کاملTreeOTU: Operational Taxonomic Unit Classification Based on Phylogenetic Trees
Our current understanding of the taxonomic and phylogenetic diversity of cellular organisms, especially the bacteria and archaea, is mostly based upon studies of sequences of the smallsubunit rRNAs (ssu-rRNAs). To address the limitation of ssu-rRNA as a phylogenetic marker, such as copy number variation among organisms and complications introduced by horizontal gene transfer, convergent evoluti...
متن کاملSignal processing approaches as novel tools for the clustering of N-acetyl-β-D-glucosaminidases
Nowadays, the clustering of proteins and enzymes in particular, are one of the most popular topics in bioinformatics. Increasing number of chitinase genes from different organisms and their sequences have beenidentified. So far, various mathematical algorithms for the clustering of chitinase genes have been used butmost of them seem to be confusing and sometimes insufficient. In the...
متن کاملPhylogenetic Analysis of Three Long Non-coding RNA Genes: AK082072, AK043754 and AK082467
Now, it is clear that protein is just one of the most functional products produced by the eukaryotic genome. Indeed, a major part of the human genome is transcribed to non-coding sequences than to the coding sequence of the protein. In this study, we selected three long non-coding RNAs namely AK082072, AK043754 and AK082467 which show brain expression and local region conservation among vertebr...
متن کاملPhylogenetic clustering and overdispersion in bacterial communities.
Very little is known about the structure of microbial communities, despite their abundance and importance to ecosystem processes. Recent work suggests that bacterial biodiversity might exhibit patterns similar to those of plants and animals. However, relative to our knowledge about the diversity of macro-organisms, we know little about patterns of relatedness in free-living bacterial communitie...
متن کامل